Musical Acoustics and Speech Communication: Musical Pitch Tracking and Sound Source Separation Leading to Automatic Music Transcription I
نویسنده
چکیده
A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relative to the best competing algorithm. In short, SWIPE estimates the pitch as the fundamental frequency of a sawtooth waveform, whose spectrum best matches the spectrum of the input signal. The short-time Fourier transform of the sawtooth waveform provides an extension to older frequency-based, sieve-type estimation algorithms by providing smooth peaks with decaying amplitudes to correlate with the fundamental frequency if present and its harmonics. An improvement on the algorithm is achieved by using only the first and prime harmonics, which significantly reduces subharmonic errors commonly found in other pitch estimation algorithms.
منابع مشابه
Musical Acoustics and Speech Communication: Musical Pitch Tracking and Sound Source Separation Leading to Automatic Music Transcription II
This paper describes research aimed at building ‘‘active music listening interfaces’’ to demonstrate the importance of music understanding technologies, including sound source separation and F0 estimation, and the benefit they offer to end users. Active music listening is a way of listening to music through active interactions. Given polyphonic sound mixtures taken from available music recordin...
متن کاملمطالعه درجات اصلی گام موسیقی ایرانی از روی طیف نتهای گام
In this paper we have extracted the notes of Iranian scale from the traditional music played by the great musician Shahnazi on the TAR. Then, by analyzing the spectrum of the notes and by using our special averaging we have found the pitch attributed to the components’ frequency and found the interval between the notes. The results are in comple agreement with Pythagorean scale. Pitch is a su...
متن کاملVoicing Determination of Speech with an Extension Toward Music Signals
This chapter reviews selected methods for pitch determination of speech and music signals. As both these signals are time variant we first define what is subsumed under the term pitch. Then we subdivide pitch determination algorithms (PDAs) into short-term analysis algorithms, which apply some spectral transform and derive pitch from a frequency or lag domain representation, and time-domain alg...
متن کاملBayesian Harmonic Models for Musical Signal Analysis
This paper is concerned with the Bayesian analysis of musical signals. The ultimate aim is to use Bayesian hierarchical structures in order to infer quantities at the highest level, including such things as musical pitch, dynamics, timbre, instrument identity, etc. Analysis of real musical signals is complicated by many things, including the presence of transient sounds, noises and the complex ...
متن کاملWhat Constitutes a Phrase in Sound-Based Music? A Mixed-Methods Investigation of Perception and Acoustics
Phrasing facilitates the organization of auditory information and is central to speech and music. Not surprisingly, aspects of changing intensity, rhythm, and pitch are key determinants of musical phrases and their boundaries in instrumental note-based music. Different kinds of speech (such as tone- vs. stress-languages) share these features in different proportions and form an instructive comp...
متن کامل